# Dense prediction
Dpt Swinv2 Base 384
MIT
The DPT (Dense Prediction Transformer) model is trained on 1.4 million images for monocular depth estimation. This model uses Swinv2 as the backbone network and is suitable for high-precision depth prediction tasks.
3D Vision
Transformers

D
Intel
182
0
Dpt Dinov2 Small Kitti
Apache-2.0
DPT model using DINOv2 as backbone for depth estimation tasks.
3D Vision
Transformers

D
facebook
710
7
Dpt Hybrid Midas
Apache-2.0
A monocular depth estimation model based on Vision Transformer (ViT), trained on 1.4 million images
3D Vision
Transformers

D
Intel
224.05k
94
Dpt Large Ade
Apache-2.0
This is a Dense Prediction Transformer (DPT) model fine-tuned on the ADE20k dataset for semantic segmentation tasks.
Image Segmentation
Transformers

D
Intel
3,497
8
Featured Recommended AI Models